Chemical Space
   HOME

TheInfoList



OR:

Chemical space is a concept in
cheminformatics Cheminformatics (also known as chemoinformatics) refers to use of physical chemistry theory with computer and information science techniques—so called "''in silico''" techniques—in application to a range of descriptive and prescriptive problem ...
referring to the property space spanned by all possible
molecule A molecule is a group of two or more atoms held together by attractive forces known as chemical bonds; depending on context, the term may or may not include ions which satisfy this criterion. In quantum physics, organic chemistry, and bioch ...
s and
chemical compound A chemical compound is a chemical substance composed of many identical molecules (or molecular entities) containing atoms from more than one chemical element held together by chemical bonds. A molecule consisting of atoms of only one element ...
s adhering to a given set of construction principles and boundary conditions. It contains millions of compounds which are readily accessible and available to researchers. It is a library used in the method of molecular docking.


Theoretical spaces

A chemical space often referred to in cheminformatics is that of potential pharmacologically active molecules. Its size is estimated to be in the order of 1060 molecules. There are no rigorous methods for determining the precise size of this space. The assumptions used for estimating the number of potential pharmacologically active molecules, however, use the Lipinski rules, in particular the molecular weight limit of 500. The estimate also restricts the chemical elements used to be Carbon, Hydrogen, Oxygen, Nitrogen and Sulfur. It further makes the assumption of a maximum of 30 atoms to stay below 500 Daltons, allows for branching and a maximum of 4
rings Ring may refer to: * Ring (jewellery), a round band, usually made of metal, worn as ornamental jewelry * To make a sound with a bell, and the sound made by a bell :(hence) to initiate a telephone connection Arts, entertainment and media Film and ...
and arrives at an estimate of 1063. This number is often misquoted in subsequent publications to be the estimated size of the whole organic chemistry space, which would be much larger if including the halogens and other elements. In addition to the drug-like space and lead-like space that are, in part, defined by the Lipinski's rule of five, the concept of known drug space (KDS), which is defined by the molecular descriptors of marketed drugs, has also been introduced. KDS can be used to help predict the boundaries of chemical spaces for drug development by comparing the structure of the molecules that are undergoing design and synthesis to the molecular descriptor parameters that are defined by the KDS.


Empirical spaces

As of July 2009, there were 49,037,297 organic and inorganic substances registered with the
Chemical Abstracts Service CAS (formerly Chemical Abstracts Service) is a division of the American Chemical Society. It is a source of chemical information. CAS is located in Columbus, Ohio, United States. Print periodicals ''Chemical Abstracts'' is a periodical index tha ...
, indicating that they have been reported in the
scientific literature : ''For a broader class of literature, see Academic publishing.'' Scientific literature comprises scholarly publications that report original empirical and theoretical work in the natural and social sciences. Within an academic field, scient ...
.
Chemical libraries A chemical substance is a form of matter having constant chemical composition and characteristic properties. Some references add that chemical substance cannot be separated into its constituent elements by physical separation methods, i.e., wi ...
used for laboratory-based screening for compounds with desired properties are examples for real-world chemical libraries of small size (a few hundred to hundreds of thousands of molecules).


Generation

Systematic exploration of chemical space is possible by creating ''
in silico In biology and other experimental sciences, an ''in silico'' experiment is one performed on computer or via computer simulation. The phrase is pseudo-Latin for 'in silicon' (correct la, in silicio), referring to silicon in computer chips. It ...
'' databases of virtual molecules, which can be visualized by projecting multidimensional property space of molecules in lower dimensions.{{cite journal , author1=L. Ruddigkeit , author2=L. C. Blum , author3=J.-L. Reymond , s2cid=18531792 , journal = J. Chem. Inf. Model. , year = 2013 , volume = 53 , issue = 1 , pages = 56–65 , doi = 10.1021/ci300535x , title = Visualization and Virtual Screening of the Chemical Universe Database GDB-17 , pmid = 23259841 Generation of chemical spaces may involve creating
stoichiometric Stoichiometry refers to the relationship between the quantities of reactants and products before, during, and following chemical reactions. Stoichiometry is founded on the law of conservation of mass where the total mass of the reactants equal ...
combinations of
electrons The electron ( or ) is a subatomic particle with a negative one elementary electric charge. Electrons belong to the first generation of the lepton particle family, and are generally thought to be elementary particles because they have no ...
and
atomic nuclei The atomic nucleus is the small, dense region consisting of protons and neutrons at the center of an atom, discovered in 1911 by Ernest Rutherford based on the 1909 Geiger–Marsden gold foil experiment. After the discovery of the neutron ...
to yield all possible topology isomers for the given construction principles. In
Cheminformatics Cheminformatics (also known as chemoinformatics) refers to use of physical chemistry theory with computer and information science techniques—so called "''in silico''" techniques—in application to a range of descriptive and prescriptive problem ...
, software programs called Structure Generators are used to generate the set of all chemical structure adhering to given boundary conditions. Constitutional Isomer Generators, for example, can generate all possible constitutional isomers of a given molecular gross formula. In the real world,
chemical reaction A chemical reaction is a process that leads to the IUPAC nomenclature for organic transformations, chemical transformation of one set of chemical substances to another. Classically, chemical reactions encompass changes that only involve the pos ...
s allow us to move in chemical space. The mapping between chemical space and molecular properties is often not unique, meaning that there can be very different molecules exhibiting very similar properties. Materials design and
drug discovery In the fields of medicine, biotechnology and pharmacology, drug discovery is the process by which new candidate medications are discovered. Historically, drugs were discovered by identifying the active ingredient from traditional remedies or by ...
both involve the exploration of chemical space.


See also

*
Cheminformatics Cheminformatics (also known as chemoinformatics) refers to use of physical chemistry theory with computer and information science techniques—so called "''in silico''" techniques—in application to a range of descriptive and prescriptive problem ...
*
Drug design Drug design, often referred to as rational drug design or simply rational design, is the inventive process of finding new medications based on the knowledge of a biological target. The drug is most commonly an organic small molecule that acti ...
*
Sequence space (evolution) In evolutionary biology, sequence space is a way of representing all possible sequences (for a protein, gene or genome). The sequence space has one dimension per amino acid or nucleotide in the sequence leading to highly dimensional spaces. M ...


References

Cheminformatics Computational chemistry